Best Image Generation Video AI Tools & Models - Premium Image Generation Video News

AI News

Aliyun's HappyHorse Goes Viral! Chinese Online Quickly Enters the Market

The Alibaba ATH Innovation Division has launched a new multimodal video generation model called HappyHorse, which has now entered a gradual testing phase. The model has demonstrated excellent performance in the three core rankings of Arena.ai (text-to-video, image-to-video, and video editing). It possesses cinematic quality and deep semantic understanding capabilities, supports 1080P ultra-high definition output, and can accurately handle various visual styles such as Hong Kong-style atmosphere and classical costumes, becoming a strong competitor in the global AI video field.

12.6k 54 minutes ago

ComfyUI Completes $30 Million Financing: Valuation Reaches $500 Million, User Base Exceeds 4 Million

ComfyUI, an AI startup that originated from an open-source project, announced on April 24 that it has completed a $30 million financing round, with a valuation of $500 million. The round was led by Craft Ventures, with participation from Pace Capital and others. Its core product is a node-based workflow platform that addresses the lack of precise control in mainstream diffusion models when generating images, videos, and audio through a modular framework, allowing users to finely adjust every step of the generation process, unlike prompt-driven platforms such as Midjourney.

10.3k 14 minutes ago

Dou Shen Education and Microsoft Azure Collaborate to Create an AI Short Drama Platform

At the Microsoft AI Tour annual event, Doushen Education launched the new 'Doushen AI Short Drama Platform,' built on a multimodal AI architecture integrating text comprehension, image generation, video generation, and intelligent dubbing. It covers scriptwriting, storyboard breakdown, and character setting, marking a major breakthrough in AI-driven content creation.....

12.6k 2 hours ago

GPT-5.5 Suddenly Appears, Is the Era of OpenAI Intelligent Agents Coming Earlier Than Expected?

When OpenAI released the ChatGPT Images 2.0 image generation tool on the same day, a mysterious model named "GPT-5.5" unexpectedly appeared in the development environment, causing a stir in the developer community. Several users discovered this model in the Codex CLI terminal interface, and Reddit user DavidAGMM confirmed this leak through a video.

36.4k just now

AI Products

GPT Image 2 App

A next-generation high-performance AI image and video generation platform, supporting powerful text rendering and 4K high-resolution output.

Image generation

Zorq AI

A powerful AI image and video generation platform. Advanced technology helps you quickly create stunning visual works.

Image generation

7.7k

MiaDance

MiaDance is an AI video and image generation platform that can quickly create content from text or images.

Video generation

5.7k

TikTomato

One-stop AI image and video generation platform with over 20 models, no need for prompts, and pay-as-you-go.

Image generation

5.4k

Models

Gemini 2.0 Flash-Lite

Google

$0.49

Input tokens/M

$2.1

Output tokens/M

Context Length

GPT-4.1 mini

Openai

$2.8

Input tokens/M

$11.2

Output tokens/M

Context Length

Grok 4 Fast

Xai

$1.4

Input tokens/M

$3.5

Output tokens/M

Context Length

o3-mini

Openai

$7.7

Input tokens/M

$30.8

Output tokens/M

200

Context Length

GPT-5 Codex

Openai

Input tokens/M

Output tokens/M

Context Length

Claude 3 Opus

Anthropic

$105

Input tokens/M

$525

Output tokens/M

200

Context Length

Gemini 2.0 Flash

Google

$0.7

Input tokens/M

$2.8

Output tokens/M

Context Length

Claude Haiku 4.5

Anthropic

Input tokens/M

$35

Output tokens/M

200

Context Length

Gemini 2.5 Flash

Google

$2.1

Input tokens/M

$17.5

Output tokens/M

Context Length

Claude Sonnet 4.5

Anthropic

$21

Input tokens/M

$105

Output tokens/M

200

Context Length

Claude 3 Sonnet

Anthropic

$21

Input tokens/M

$105

Output tokens/M

200

Context Length

Gemini 2.5 Flash-Lite

Google

$0.7

Input tokens/M

$2.8

Output tokens/M

Context Length

qwen3-coder-plus

Alibaba

Input tokens/M

$16

Output tokens/M

Context Length

Qianfan-Lightning

Baidu

Input tokens/M

Output tokens/M

128

Context Length

wan2.5-i2i-preview

Alibaba

Input tokens/M

Output tokens/M

Context Length

qwen3-max

Alibaba

Input tokens/M

$24

Output tokens/M

256

Context Length

qwen3-vl-plus

Alibaba

Input tokens/M

$10

Output tokens/M

256

Context Length

qwen-image-plus

Alibaba

Input tokens/M

Output tokens/M

Context Length

qwen-image-edit

Alibaba

Input tokens/M

Output tokens/M

Context Length

Doubao-Seed-Translation

Bytedance

$1.2

Input tokens/M

$3.6

Output tokens/M

Context Length

MCP

MiniMax MCP Server

The MiniMax Model Context Protocol (MCP) is an official server that supports interaction with powerful text-to-speech, video/image generation APIs, and is suitable for various client tools such as Claude Desktop and Cursor.

python

55.3k

4.8points

MiniMax

Verified

MiniMax's official Model Context Protocol (MCP) server supports interactions with APIs such as text-to-speech, video/image generation.

python

13.8k

4.0points

Amazon Bedrock Nova

An SSE-based MCP server providing image and video generation tools

python

9.7k

2.5points

Media Gen Mcp

Media Gen MCP is a server that strictly follows the TypeScript and MCP specifications, focusing on generating and editing images and videos using OpenAI and Google's AI models. It provides a series of tools, including image generation/editing, video creation/remixing, file acquisition and processing, and supports intelligent resource linking and inline output. It is suitable for various MCP-compatible clients.

typescript

6.3k

2.5points

Luma Api Mcp

Luma API MCP is a project that provides image and video generation services. Users can access it through an API key, supporting multiple aspect ratios, models, and resolution options, and can control the generation effect through reference images or video keyframes.

python

9.5k

2.5points

MiniMax MCP JS

MiniMax MCP JS is a MiniMax Model Context Protocol toolkit implemented in JavaScript/TypeScript, providing functions such as text-to-speech, image generation, video generation, and voice cloning, and supporting multiple configuration methods and transmission modes.

typescript

10.2k

2.5points

RunwayML + Luma AI

A multi - functional MCP server integrating RunwayML and Luma AI APIs, supporting video/image generation and processing tasks

typescript

9.1k

2.5points

Ghibli Mcp Video Server

An MCP server based on TypeScript that provides AI image and video generation functions, requiring the support of an API key from GPT4O Image Generator.

typescript

2.5points

Mcp Kling

MCP Kling is the first and only complete Kling AI MCP server, offering 13 creative tools that support video generation, image processing, lip-syncing, and virtual try-on, enabling seamless integration with Claude, suitable for content creators and developers.

typescript

9.4k

2.5points

Gemini Mcp

This is an MCP server based on Google's Gemini API, providing text conversation, image generation, and video generation functions. It can be used as an alternative to Codex MCP.

typescript

8.3k

2.5points

MiniMax Multimodal

MiniMax MCP JS is a MiniMax MCP protocol toolset implemented in JavaScript/TypeScript, providing functions such as image generation, video generation, and text-to-speech, and supporting interaction with MCP-compatible clients.

typescript

10.3k

2.5points

Fal Mcp Server

The AI Video Generation MCP Server supports text and image input to generate dynamic videos, providing multiple parameter controls and model selections.

typescript

10.7k

2.5points

Ghibli Video Generator

A TypeScript-based MCP server that provides AI image and video generation functions and requires support from the API key of GPT4O Image Generator.

typescript

8.7k

2.5points

Fal Image Video Mcp

The FAL Image and Video MCP Server is a high - performance MCP protocol server specifically designed for image and video generation in FAL AI, supporting automatic download to local machines. It provides public URLs, data URLs, and local file paths, suitable for MCP - compatible clients such as Claude.

typescript

9.1k

2.5points

Gemini with Web Search

The MCP Gemini API server is a Google Gemini API proxy service designed for Cursor and Claude, providing functions such as text generation, image analysis, video analysis, and web search.

typescript

11.1k

2.5points

Runway Api Mcp Server

This is an MCP server project based on the Runway API, allowing users to call various AI generation functions of Runway through Claude Desktop, including tools such as video generation, image generation, video editing, and upscaling.

typescript

5.6k

2.5points

Mcp Veo2

This project is a video generation MCP server based on the Google Veo2 model. It supports video generation through text prompts or images and provides MCP resource access functions.

typescript

10.2k

2.5points

Vidu Mcp Server

The Vidu MCP Server is a server based on the Model Context Protocol, used to interact with the Vidu video generation API, providing functions such as image - to - video conversion, generation status check, and image upload.

typescript

10.2k

2.5points

MiniMax MCP

MiniMax-MCP is a multi-functional server project that provides API services such as text-to-speech, video generation, and image generation, supporting developers to integrate advanced multimedia features.

python

9.5k

2.5points

Luma Ai Mcp Server

Luma AI's MCP Server realizes functions such as text/image - to - video generation, video enhancement, and creative content management through the Dream Machine API

python

10.8k

2.0points

Empowering the future, your artificial intelligence solution think tank

English 简体中文繁體中文にほんご

FirendLinks:

AI Newsletters AI Tools MCP Servers AI News AIBase LLM Leaderboard AI Ranking

Business Cooperation Site Map

AI News

Aliyun's HappyHorse Goes Viral! Chinese Online Quickly Enters the Market

ComfyUI Completes $30 Million Financing: Valuation Reaches $500 Million, User Base Exceeds 4 Million

Dou Shen Education and Microsoft Azure Collaborate to Create an AI Short Drama Platform

GPT-5.5 Suddenly Appears, Is the Era of OpenAI Intelligent Agents Coming Earlier Than Expected?

AI Products

GPT Image 2 App

Zorq AI

MiaDance

TikTomato

Models

Gemini 2.0 Flash-Lite

GPT-4.1 mini

Grok 4 Fast

o3-mini

GPT-5 Codex

Claude 3 Opus

Gemini 2.0 Flash

Claude Haiku 4.5

Gemini 2.5 Flash

Claude Sonnet 4.5

Claude 3 Sonnet

Gemini 2.5 Flash-Lite

qwen3-coder-plus

Qianfan-Lightning

wan2.5-i2i-preview

qwen3-max

qwen3-vl-plus

qwen-image-plus

qwen-image-edit

Doubao-Seed-Translation

HunyuanVideo 1.5_I2V_720p GGUF

HunyuanVideo 1.5_I2V_480p GGUF

Chronoedit

HunyuanVideo 1.5

Sam3

LongCat Video

Qwen3 VL 32B Thinking AWQ

WAN2.2 I2V_A14B DISTILL LIGHTX2V 4STEP GGUF

Ovi

Wan2.1 HuMo GGUF

Wan2_1 HuMo_17B GGUF

Wan2.2 VACE Fun A14B

Wan2.2 S2V 14B

Show O2 7B

Show O2 1.5B

Qwen2.5 Omni 3B GGUF

Ltxv 13b 0.9.7 Distilled GGUF

HunyuanCustom Gguf

Ming Lite Omni

SkyReels V2 DF 14B 540P GGUF

MCP

MiniMax MCP Server

MiniMax

Amazon Bedrock Nova

Media Gen Mcp

Luma Api Mcp

MiniMax MCP JS

RunwayML + Luma AI

Ghibli Mcp Video Server

Mcp Kling

Gemini Mcp

MiniMax Multimodal

Fal Mcp Server

Ghibli Video Generator

Fal Image Video Mcp

Gemini with Web Search

Runway Api Mcp Server

Mcp Veo2

Vidu Mcp Server

MiniMax MCP

Luma Ai Mcp Server